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Abstract — We  are  developing  a technique  for  recognizing 
patterns  below  clutter  based  on  modeling  field  theory.  The 
presentation  briefly  summarizes  the  difficulties  related  to 
the  combinatorial  complexity  of  computations,  and  analyzes 
the  fundamental  limitations  of  existing  algorithms  such 
as  Multiple  Hypothesis  Testing.  A new  concept,  dynamic 
logic,  is  introduced  along  with  an  algorithm  suitable  for 
pattern  recognition  in  images  with  intense  clutter  data.  This 
new  mathematical  technique  is  inspired  by  the  analysis  of 
biological  systems,  like  the  human  brain,  which  combines 
conceptual  understanding  with  emotional  evaluation  and 
overcomes  the  combinatorial  complexity  of  model-based 
techniques.  The  presentation  provides  examples  of  object 
pattern  recognition  below  clutter. 

1.  Introduction 

The  algorithm  proposed  in  this  paper  is  developed  for  a 
specific  solution  but  has  broad  applications.  Relating  data 
to  conceptual  content  is  the  challenge  of  many  current 
methodologies  in  data  mining,  sensor\data  fusion,  signal 
processing  and  exploitation.  We  use  modeling  field  theory 
(MFT)  [1]  to  relate  data  and  object-concepts.  Just  as  the 
human  mind  relates  internal  visual  representations  of  objects 
with  retinal  signals,  MFT  associates  a moving  object-concept 
model  with  the  input  image.  Well-known  model-based 
techniques  such  as  multiple  hypothesis  testing  (MHT)  rely 
on  trying  many  combinations  of  models  and  data  to  find 
the  object  model  that  best  fits  the  data.  MHT  is  governed 
by  formal  logic,  which  starts  with  exact  statements  and 
derives  exact  conclusions.  But  the  human  mind  relates 
models  to  data  at  a pre-conscious,  fuzzy  level.  The  mind 
does  not  sift  through  every  possible  internal  visual  object- 
model,  in  every  possible  orientation,  and  in  combination 
with  other  objects  to  recognize  an  object,  such  as  a car 
on  the  road.  Rather  visual  models  are  associated  pre- 
consciously  with  retinal  signals  to  quickly  accomplish  object 
recognition.  In  a similar  manner  to  visual  object  recognition, 
MFT  associates  models  and  data  “pre-consciously”. 
The  algorithm  that  accomplishes  MFT  is  referred  to 
as  dynamic  logic  [1].  Whereas  MHT  relies  on  formal 
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logic,  MFT  relies  on  dynamic  logic,  which  starts  with 
fuzzy  statements  and  derives  exact  conclusions. 

Dynamic  logic  mitigates  combinatorial  complexity  by 
comparing  all  models  and  data  simultaneously.  Fuzzy 
models  dynamically  converge  onto  corresponding  data  using 
feedback  from  a similarity  measure.  Each  model  can  then  be 
understood  as  behaving  like  an  intelligent  agent  by  refining 
its  parameters  to  extract  maximum  object  information  from 
the  data.  Just  like  internal  visual  models  “grab”  onto  the 
corresponding  internal  retinal  signals,  internal  models  of 
stationary  and  moving  objects  converge  onto  patterns  in 
images  that  best  fit  the  models.  The  computational  cost  of  the 
dynamic  logic  algorithm  (DLA)  increases  linearly  with  the 
number  of  object  models,  while  computations  of  the  classical 
MHT  algorithm  increase  exponentially  with  additional  models. 

Problem  Statement 

A common  problem  with  object  recognition  is  the  existence 
of  overwhelming  clutter  data  in  the  image.  Figure  I displays 
the  image  of  an  object  moving  in  a straight  line  without  the 
addition  of  clutter.  When  clutter  is  added  to  the  image  in 
Figure  2,  the  object  becomes  much  more  difficult  to  observe. 
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Fig.  1.  Object  signal 

The  signal  to  clutter  ratio  (SCR)  is  an  indication  of  the 
difficulty  to  recognize  an  object  pattern  in  an  image.  The 
SCR  is  defined  as  the  ratio  of  the  maximum  absolute 
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Fig.  2.  Object  signal  + clutter 


value  of  the  object  signal  over  the  average  clutter  power. 
The  iterative  recognition  algorithm  we  present  is  able  to 
positively  determine  if  the  object  is  present  in  an  image, 
even  for  very  low  SCR  (such  as  Figure  2 with  SCR  = 2.4  dB). 

Models 

We  utilize  modeling  field  theory  (MFT)  to  develop  an  iterative 
DLA  that  is  able  to  determine  the  presence  of  an  object 
signal  in  an  image  buried  in  clutter.  A number  of  models 
are  introduced  that  account  for  the  presence  of  every  pixel  in 
the  image.  The  models  are  probability  distribution  functions: 
pdf(n|m),  for  the  signal  in  pixel  n given  it  came  from  model 
m.  Clutter  is  one  model  that  we  index  with  m = 1.  The  clutter 
model  (pdf)  is  uniformly  distributed  and  expressed  as: 

pdf(n|l)  = fj.  (1) 

The  parameter  fi  is  a single  number  that  represents  the 
estimated  proportion  of  signal  power  in  the  entire  image 
coming  from  clutter. 

The  moving  object  model  is  indexed  by  m = 2.  The  model 
(pdf)  for  all  pixels  n given  the  signal  of  a particular  pixel 
came  from  the  object  is  expressed  as: 

pdf(n|2)  = ^ f2  • G(n|xt,  crx),  (2) 

where  the  parameter  r2  represents  the  proportion  of  signal 
power  coming  from  the  object.  The  pdf  for  the  object  model 
is  computed  using  a sum  of  Gaussian  distributions  G over 
time  t = 1,  2,...,  T.  The  parameter  xt  represents  the  estimated 
position  of  the  object  over  t in  both  the  x and  y dimensions. 
For  instance,  if  the  moving  object  model  is  a straight  line,  than 
the  linear  equations, 

X = x0  + vxt, 

y = yo  + vyt,  (3) 


describe  the  object’s  position,  where  (x0,y0)  are  the  object’s 
estimated  initial  component  positions  and  (vx,vy)  are  its 
estimated  component  velocities.  The  parameter  ax  represents 
the  standard  deviation  of  the  Gaussian  distributions,  also  in 
both  the  x and  y dimensions.  The  number  of  terms  in  the 
summation  (eq.  2)  should  be  sufficient  to  accurately  model 
the  entire  moving  object  pattern. 

The  total  pdf  for  all  pixels  is  the  summations  of  the  conditional 
pdfs: 

pdf(n)  = pdf(n|l)  + pdf(n|2).  (4) 

It  is  not  required  that  the  object  pattern  be  a straight  line  as 
in  the  example  above.  The  pattern  can  exhibit  any  curvature 
or  shape  and  be  modeled  by  a non-linear  equation. 

2.  The  Dynamic  Logic  Algorithm 

The  procedure  of  the  DLA  is  summarized  as  follows  [1],  [2]: 

1)  Set  initial  value  estimates  for  the  unknown  object 
parameters  xo,yo,  vx,  vy,r2.  Also  set  initial  values  for  the 
standard  deviations  ax  and  <ry. 

In  absence  of  any  information  about  the  object,  we 
set  the  model  initial  position  in  the  center  of  the 
image  with  zero  velocity.  Referring  to  Figure  2, 
xo  = Yo  = Om,vx  = vy  = Om/s. 

The  standard  deviations  are  initialized  to  a large  value,  in- 
cluding the  entire  image,  corresponding  to  the  uncertainty  of 
knowledge: 

0-x  = <Jy  = %/n/2,  (5) 

where  N is  the  total  number  of  image  pixels. 

The  proportion  of  clutter  power  and  the  proportion  of  object 
power  are  initialized  to  be: 

fi=[\/N  - l]  / \/N, 

r2  = l-ri.  (6) 


2)  With  values  assigned  to  the  estimated  parameters,  the 
probability  distribution  functions,  eq.  (1),  (2),  can  be 
computed,  as  well  as  the  total  pdf(n). 


3)  The  probabilities  for  each  pixel  are  then  computed 
as  follows: 

p(-|i)  = Sf  ■ 


P(«|2) 


pdf(n|2) 

pdf(n) 


(7) 


This  can  be  interpreted  as  follows:  P(n|l)  is  the  probability 
that  pixel  n “belongs  to  the  clutter  model”  or  that  the  signal 
in  a particular  pixel  originates  from  clutter;  P(n|2)  is  the 
probability  pixel  n “belongs  to  the  object  model”  or  that 
the  signal  in  a particular  pixel  originates  from  the  object. 
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for  n=l,2,  ...N.  P(n|l)  + P(n|2)  — 1 for  n = l,2,  ...N, 
in  correspondence  with  the  total  probability  (for  each  pixel 
receiving  any  signal)  being  1, 

Note  that  these  probabilities  are  estimates  based  on  the 
current  object  parameter  estimates  for  a particular  iteration. 
Therefore,  these  quantities  are  not  “real”  probabilities,  they 
can  be  considered  as  subjective,  estimated  probabilities. 

4)  A similarity  measure  is  computed  that  evaluates  the 
correspondence  of  the  estimated  model  to  the  image  data 
S(n).  This  can  be  interpreted  as  mutual  information  in  models 
about  the  data  [1]: 

£ s(n)ln  [pdf(n)] 

(8) 

£S(n) 

n=l 

where  S(n)  is  the  absolute  value  of  the  complex  signal  data 
in  pixel  n. 

Dynamic  logic  maximizes  this  function  over  the  five  unknown 
parameters: 

max(L)  . (9) 

*o,y0>^x>Vy,f2 

For  the  linear  pattern  equations  (3)  above,  the  solution  to  the 
unknown  parameters  that  incrementally  maximize  L (for  each 
iteration)  is  found  analytically  by  taking  the  partial  derivative 
with  respect  to  each  parameter.  The  following  equations  are 
the  result: 

< xn  > -xo  < 1 > 4-vx  < t >=  0, 

< xnt  > -x0  < t > + vx  < t2  >=  0, 


£ R(m) 

m=l 

for  m = 1,  2. 

The  parameters,  xo,yo> vx, vy,fi,r2  are  improved  estimates 
for  the  current  iteration. 

6)  The  standard  deviations  ax  and  ay  are  reduced  by  a 
factor  between  0.93  and  0.99.  (For  the  examples  that  are  to 
follow  we  used  a factor  of  0.96.  We  did  not  yet  optimize 
the  convergence  process.  Convergence  of  this  algorithm  was 
proved  in  [1]) 

7)  The  algorithm  iterates  to  step  2.  The  process  is  continued 
until  a predefined  value  (possibly  2 or  3 pixel  resolutions)  of 
ax  and  ay  is  realized.  (For  our  examples  it  took  80  iterations 
to  reach  a minimum  required  standard  deviation  of  2 pixel 
resolutions). 

Example  1:  A Linear  Object  Pattern 

The  DLA  is  applied  to  a 101  x 101  pixel  image 
that  contains  an  object  signal  with  the  parameters: 
xo  = — 30m,  yo  = 42m,  vx  = lm/s,vy  = -0.4m/s.  The 
object  signal  alone  is  seen  in  Figure  1.  The  image  data  that 
the  DLA  operates  on  has  clutter  at  an  SCR  of  2.4  dB  (see 
Figure  2).  There  are  21  Gaussian  distributions  used  to  model 
the  object  pattern. 

Figures  3 through  7 illustrate  for  several  iterations  the  sum  of 
Gaussian  distributions  over  t in  both  dimensions  x and  y. 


< yn  > -yo  < l > + vy  < t >=  0, 

< ynt  > -yo  < t > + vy  < t2  >=  0. 


(10) 


Here  yn  and  xn  are  the  known  position  coordinates  of  each 
pixel  n.  The  parameter  t is  the  same  time  vector  used  in 
equations  (3). 

The  brackets  <>  represent  the  following  function  on  the 
known  parameters: 

£ N£1  u(n,  t)  |S(n)|  P(n|2) 

< U > = H-1  t~°N , (11) 

£|S(n)|P(n|2) 

n=l 

for  u = t,  t2,  x„,  xnt,  y„,  ynt. 


5)  Now  the  proportional  power  parameter  estimates  are 
computed  using  the  updated  pdfs  and  the  image  data: 


R<-)4s(n,ES? 


(12) 
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Fig.  3.  Iteration  1 

Figure  3 shows  the  first  iteration  where  the  initial  position 
estimate  is  at  the  center  of  the  image  with  vx  = vy  = 0, 
and  with  standard  deviations  large  enough  to  include  most 
of  the  pixels  with  non-zero  probability.  Here  the  Gaussian 
distributions  are  located  at  the  same  point  since  the  velocity 
estimates  are  zero.  In  Figure  4 we  see  crx  and  ay  are  reduced 
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Fig.  4.  Iteration  30 


Fig.  7.  Iteration  80 
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Fig.  5.  Iteration  50 
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Fig.  6.  Iteration  60 

but  the  parameters  estimates  are  still  at  the  image  center  with 
no  velocity.  As  ax  and  ay  are  further  reduced  in  Figures  5 
and  6,  the  initial  position  estimates  move  near  the  target  and 
the  velocity  estimates  take  on  some  value.  Note  the  similarity 


in  the  location  and  shape  of  the  estimated  distributions, 
Figure  7,  to  the  true  object  path  seen  in  Figure  1,  even  though 
the  algorithm  was  only  provided  the  clutter  image  in  Figure  2. 

Multiple  Object  Recognition 

Determining  the  presence  of  more  than  one  object  requires 
adding  more  models  to  the  algorithm.  Extending  the  single 
object  case,  m = 1 represents  clutter,  m = 2 represents 
one  object,  m = 3 represents  a 2nd  object,  ...  , m = M 
represents  object  M-l.  Each  object  has  its  own  parameters 
to  be  estimated:  xom,yomi  vxm,vym,fm.  The  conditional  pdf 
for  clutter  is  given  above,  equation  1,  and  the  pdfs  for  the 
remaining  object  models  are  computed  as: 

pdf(n|m)  = ^ rm  • G(n|xtm, <rx),  (14) 

for  m = 2,3...,M.  The  parameter  xtm  is  the  estimated 
position  of  object  m at  time  t,  and  rm  is  the  estimated 
proportion  of  signal  power  coming  from  object  m.  The 
total  pdf(n)  is  computed  as  pdf(n|m)  and  the  similarity 

m 

measure  L is  maximized  with  respect  to  the  unknown 
parameters  for  all  object  models. 

Example  2:  Three  Non-Linear  Object  Patterns 

The  following  is  a case  with  three  objects  in  the  image,  and 
their  motion  is  produces  a contoured  pattern.  Figure  8 displays 
the  input  image  to  the  DLA,  and  Figure  9 displays  the  object 
signals  without  clutter.  The  signal  to  clutter  ratios  for  the 
individual  objects  are,  from  the  top-right  moving  clockwise, 
-0.70  dB,  -0.73  dB,  and  -1.98  dB. 

The  distributions  operating  on  the  clutter  image  (Figure  8)  for 
the  three  object  models  are  seen  in  Figures  10  through  14. 
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Fig.  8.  Object  + clutter  signals 


Fig.  11.  Iteration  30 


Fig.  9.  Object  signals 


Fig.  12.  Iteration  40 


Fig.  10.  Iteration  1 

Fig.  13.  Iteration  70 

Again  note  the  similarity  between  the  final  result  of  the 
dynamic  logic  algorithm  estimates  (Figure  14)  and  the 
object  signals  (Figure  9).  The  object  models  were  accurately 
estimated  at  an  SCR  where  recognition  was  previously 
considered  unattainable. 
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Fig.  14.  Iteration  80 


3.  Conclusion 

The  dynamic  logic  algorithm  (DLA)  is  a novel  technique 
for  model-based  estimation  and  pattern  recognition  that  has 
proven  to  overcome  previous  limitations  of  model-based 
techniques.  For  example,  multiple  hypothesis  testing  (MHT) 
is  the  general  state-of-the-art  technique  for  model-based 
estimation  and  recognition  restricted  by  a combinatorial 
explosion  of  computations.  The  addition  of  object  models 
increases  the  number  of  computations  exponentially  with 
MHT,  whereas  a linear  increase  is  expected  using  the  DLA. 

Using  the  DLA  on  the  object  patterns  illustrated  in  this 
paper  has  yielded  an  effective  performance.  We  were  able  to 
successfully  demonstrate  an  application  of  the  DLA  to  patterns 
below  clutter,  with  only  a linear  increase  of  complexity  for 
multiple  objects. 
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